A Speech Preprocessing Method Based on Perceptually Optimized Envelope Processing to Increase Intelligibility in Reverberant Environments
نویسندگان
چکیده
Speech intelligibility in public places can be degraded by the environmental noise and reverberation. In this study, a new near-end listening enhancement (NELE) approach is proposed which using time varying filter jointly enhances onsets reduces overlap masking. For optimization, some look-ahead clean speech prior knowledge of room impulse response (RIR) are required. method, optimizing defined cost function, Spectro-Temporal Envelope reverb optimized to as close possible that speech. with increased weight. This different from overlap-masking ratio (OMR) (OE) approaches (Grosse, van de Par, 2017, J. Audio Eng. Soc., Vol. 65 (1/2), pp. 31–41) only consider previous frames each slot for determining variant filtering. The SRT measurements show optimization framework up 2 dB more OE.
منابع مشابه
A preprocessing technique for improving speech intelligibility in reverberant environments: the effect of steady-state suppression on elderly people
In a large auditorium, perceiving speech may become difficult. One reason that reverberation degrades speech intelligibility is the effect of overlap-masking (Bolt and MacDonald, 1949; Nabelek and Robinette, 1978). Reverberation is a more critical issue for elderly people to perceive speech than it is for young people (Fitzgibbons and Gordon-Salant, 1999). Arai et al. suppressed steady-state po...
متن کاملModulation enhancement of speech by a pre-processing algorithm for improving intelligibility in reverberant environments
Most listeners have difficulty understanding speech in reverberant conditions. The purpose of this study is to investigate whether it is possible to reduce the degree of degradation of speech intelligibility in reverberation through the development of an algorithm. The modulation spectrum is the spectral representation of the temporal envelope of the speech signal. That of clean speech is domin...
متن کاملBinary Mask Estimation for Improved Speech Intelligibility in Reverberant Environments
A blind (non-ideal) time-frequency (T-F) masking technique is proposed for suppressing reverberation. A binary mask is estimated at each T-F unit by extracting a single variance-based feature from the reverberant signal and comparing its value against an adaptive threshold. The performance of the estimated binary mask is evaluated using intelligibility listening tests with hearing impaired list...
متن کاملPerceptually Inspired Signal-processing Strategies for Robust Speech Recognition in Reverberant Environments
Perceptually Inspired Signal-processing Strategies for Robust Speech Recognition in Reverberant Environments
متن کاملDesigning modulation filters for improving speech intelligibility in reverberant environments
In this paper, we propose a new technique to design modulation filters to reduce degradation of speech intelligibility in reverberant environments. Using the inverse modulation transfer function, we design data-derived modulation filters for each speech frequency band. These filters preprocess speech signals between a microphone and a loudspeaker that radiates speech into a performance hall. Us...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2021
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app112210788